First steps towards autonomous recognition of Monterey Bay’s most common mid-water organisms: Mining the ROV video database on behalf of the Automated Visual Event Detection (AVED) system
نویسندگان
چکیده
The development of remotely operated vehicles (ROVs) has revolutionized the world of marine science by allowing quantitative video transects (QVTs) to be recorded underwater. This non-invasive technique allows previously unavailable information to be obtained concerning organism diversity, distribution, and abundance. Unfortunately, processing of these QVTs at the Monterey Bay Aquarium Research Institute (MBARI) is a lengthy and tedious process that prevents rapid data analysis. The development of a neuromorphic computer vision system is currently in progress to solve this problem of the data processing bottleneck. The proposed computer vision system combines a saliency-based attentional module and a recognition module, both designed after the human visual system. The “saliency” model mimics the largely unconscious “bottomup” search mechanism of primates, responding specifically to visual cues in the surrounding environment. The recognition module involves learning of specific targets and requires “top-down” biasing based on learned traits. The attentional module at MBARI can already determine areas of potential interest in the marine environment. However, in order to begin training the computer system for the recognition module, a test database of representative organisms must first be established. The extensive MBARI database was mined to reveal the relative abundance and frequency of annotations of all organisms in the Monterey Bay. Based on the rankings, representative video clips were compiled of the top 25 most common mid-water organisms as a preliminary test database for the recognition module.
منابع مشابه
Recognition of Visual Events using Spatio-Temporal Information of the Video Signal
Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...
متن کاملA Detection, Tracking, and Classification System for Underwater Images
Traditional underwater images for ecology study often include time-lapse or brief video recordings over short periods of time. More recently, with the installation of under water cabled observatories that provide constant electrical power and data connections to the seafloor, longterm video recordings are possible for the first time. These valuable recordings are essential for underwater ecolog...
متن کاملFace Detection with methods based on color by using Artificial Neural Network
The face Detection methodsis used in order to provide security. The mentioned methods problems are that it cannot be categorized because of the great differences and varieties in the face of individuals. In this paper, face Detection methods has been presented for overcoming upon these problems based on skin color datum. The researcher gathered a face database of 30 individuals consisting of ov...
متن کاملFace Detection at the Low Light Environments
Today, with the advancement of technology, the use of tools for extracting information from video are much wider in terms of both visual power and the processing power. High-speed car, perfect detection accuracy, business diversity in the fields of medical, home appliances, smart cars, humanoid robots, military systems and the commercialization makes these systems cost effective. Among the most...
متن کاملAction Change Detection in Video Based on HOG
Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...
متن کامل